ZIP and data document visualization
نویسندگان
چکیده
Text data modeling has been usually considered with Bernoulli or multinomial event models. Poisson distribution is considered inefficient for text information retrieval. In this work, we propose to incorporate the Zero Inflated Poisson model in the Generative Topographic Mapping algorithm. The modified algorithm is presented as a text document cluster extraction and visualization tool. Experimental results are presented for the Medlars, CISI and Cranfield collections, observing notable class separation.
منابع مشابه
دیداری کردن نتایج جستوجو در فرایند بازیابی اطلاعات
Purpose: One of the most effective ways to achieve optimum information retrieval is through visualization of Information. Search strategies, probing skills, querying of information needs and analysis of information play a significant role in the accessing of necessary and useful information. Besides the factors mentioned above, information visualization can increase the availability level of in...
متن کاملGeovisualization for Knowledge Construction and Decision Support____________________________________ Visualization Viewpoints Ieee Computer Graphics and Applications Integrating and Extending Perspectives Functions of Geovisualization
Published by the IEEE Computer Society 0272-1716/04/$20.00 © 2004 IEEE IEEE Computer Graphics and Applications 13 We now have access to vast digital data resources that include geospatial referencing.This referencing ranges from precise geographic coordinates, through street addresses, to codes for administrative or other types of regions (such as zip codes and drainage basin indices). GPS rece...
متن کاملVizCluster and Its Application on Clustering Gene Expression Data
Visualization enables us to find structures, features, patterns and relationships in a dataset by presenting the data in various graphical forms with possible interactions. A visualization can provide a qualitative overview of large and complex datasets, can summarize data, and can assist in identifying regions of interest and appropriate parameters focused on quantitative analysis. Recent deve...
متن کاملCreating a Local Geographic Influenza-like Illness Activity Report
Methods The data elements that were included in this study were: daily emergency department (ED) ILI rate, 7 day moving average ED ILI rate, zip code, and date from October 1, 2013 to March 31, 2014. The data region was the catchment area of an urban academic medical center (AMC). The data were processed using the GUARDIAN (Geographic Utilization of Artificial Intelligence in Real-Time for Dise...
متن کاملGeovisualization for Knowledge Construction and Decision Support____________________________________ Visualization Viewpoints Integrating and Extending Perspectives Functions of Geovisualization
0272-1716/04/$20.00 © 2004 IEEE Published by the IEEE Computer Society IEEE Computer Graphics and Applications 13 We now have access to vast digital data resources that include geospatial referencing.This referencing ranges from precise geographic coordinates, through street addresses, to codes for administrative or other types of regions (such as zip codes and drainage basin indices). GPS rece...
متن کامل